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Abstract. We present a deterministic algorithm for computing all irreducible factors of degree 
< d of a given bivariate polynomial / £ K[x,y] over an algebraic number field K and their 
multiplicities, whose running time is polynomial in the bit length of the sparse encoding of the 
input and in d . Moreover, we show that the factors over <Q of degree < d which are not 
binomials can also be computed in time polynomial in the sparse length of the input and in d . 



Introduction 

Effective factorization of polynomials, when possible, is an important task in computational algebra 
and number theory. This problem has a long story, going back to I. Newton in 1707, and to 
the astronomer F. von Schubert who in 1793 presented an algorithm for factoring a univariate 
polynomial, later rediscovered and generalized by L. Kroneckcr in 1882. Many other more efficient 
algorithms were designed since then, between the most famous ones we cite |Ber70llZas69] . 
In 1982, A.K. Lenstra, H.W. Lenstra Jr. and L. Lovasz made a fundamental advance by obtaining 
the first deterministic polynomial-time algorithm for factoring a univariate polynomial over the 
rationals |LLL82| . Based on this result and the technique of lattice basis reduction introduced for 
its proof, several new factorization algorithms were obtained (CC82I IT^n84l IKal85l rTar^l 1 1 ,en87| . 
These algorithms succeeded in bringing to polynomial time the problem of factoring univariate and 
multivariate polynomials over algebraic number fields when given by their dense encoding, that is 
the input / is given by the list of all its terms of degree < deg(/) including the zero ones. 

For practical purposes, it is usually more realistic to consider the sparse (or lacunary) encoding of 
a polynomial. In this paper we consider the problem of factoring a bivariate polynomial 

t 

i=l 

given in sparse encoding: by the list (a^, cti, Pi)i<i<t of its non-zero coefficients and corresponding 
exponents. Let £(f) denote the bit length of the sparse encoding of / ; informally speaking this is 
the number of bits needed to spell out the data. We obtain a deterministic algorithm for computing 
the low degree factors of / in time polynomial in £(f) : 

Theorem 1. There is a deterministic algorithm that, given f S 1\x, y] and d > 1 , computes all 
irreducible factors of f in Q[x,y] of degree <d together with their multiplicities, in (d- £(f))°^ 1 ' 
bit operations. 
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More generally, this algorithm applies for factoring bivariate polynomials over number fields (see 
Subsection I3.2|l . 

Let us observe that since the degree of a polynomial can be exponentially big in its sparse length 
(we have deg(/) < and this upper bound is attainable), a direct application of the algorithms 
for factoring dense polynomials would give an exponential complexity. The restriction to bounded 
degree factors is unavoidable: the polynomial / = x p — 1 (p prime) is of sparse length log 2 (p) + 
0(1) but has the dense irreducible factor x v ~ x + • • • + 1 . 

The first result in this direction appeared in 1998, when F. Cucker, P. Koiran and S. Smale showed 
how to find all the integer roots of a univariate polynomial with integer coefficients in polynomial 
time in its sparse encoding, and asked whether one can find in the same time the rational roots 
as well |CKS99| . This question (and more!) was affirmatively answered by H.W. Lenstra Jr. who 
presented an algorithm that — given a number field K and a univariate polynomial / e .fT [re- 
computes all its irreducible factors of degree < d together with their multiplicities, in (d+£(f))°^ 
bit operations Len99b , Thm] . The first and inspiring result in the multivariate setting was obtained 
by E. Kaltofen and P. Koiran KK05, Thm 3] last year, who showed how to compute the linear 
factors of a bivariate polynomial / € Q[x,y] in polynomial time in £(f) . Our result is then 
an extension of Kaltofen-Koiran's, and a full generalization of Lenstra's to the case of bivariate 
polynomials. 

All these algorithms (including ours) are based on a gap principle first applied by Cucker, Koiran 
and Smale. The idea is so strikingly simple and natural that it deserves to be explained. Let 
/ € Z[x] and £ 6 Z be given, how can we test if /(£) =0? Direct evaluation is not feasible, as 
the size of /(£) can be exponentially big in the input size; an important exception to this are the 
easy cases £ = 0, ±1 . Assume that / = J2l=i a t xCli can be split as 

f = r + x u q 

for non-zero polynomials r of degree deg(r) = k and q , where there is a gap between the exponents 
of r and those of q of length 

u-fc>log 2 ||/||a 
(here ||/||i := J2l=i \ a i\ denotes as usual the i 1 -norm of /). 

Except for the cases £ = 0, ±1, this implies that /(£) = if and only q(£) = r(£) = 0: suppose 
this is not the case, namely /(£) = but ^ , then 

KOI < IMIi • \t\ k < ll/lli • m k and |r(0| = |£P ■ |<?(0I > l£P 

so that ||/||i > |£| u-fc > 2 u ~ k , which contradicts the gap assumption! Therefore, to test if / 
vanishes at £ ^ 0, ±1 , one decomposes / into widely spaced short pieces 

/ = X>7< 

i 

and tests if fi (£) = for all i . 

One crucial fact here is that the decomposition is independent of the point £ ; therefore to find 
integer roots it is enough to find the common roots of a set of low degree polynomials. 
The other key ingredient that makes the above argument work is that any integer £ ^ 0,±1 
satisfies a uniform lower bound |£| > 2 ! In order to apply the same idea to ( £ Q, the correct 
generalization of the absolute value is the height, defined as the maximum between numerator and 
denominator. By imitating the argument above, but this time for the usual absolute value and 
all the p-adic ones, we arrive at the same conclusion as a consequence that all rational numbers 
except 0, ±1 have height at least 2 . This is essentially what Lenstra applied in Lcn99b ; more 
generally, he was able to handle in this way other factors besides the linear ones by considering 
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the height of their roots after applying a suitable lower bound for them, namely Dobrowolski's 
theorem DobZ2] m the version of P. Voutier |Vou96| . In |KK05| . the authors succeeded to present 
the first generalization of this gap principle for non-univariate polynomials, more precisely for 
linear factors of bivariate polynomials. 

As in these previous works, the key of our algorithm is a suitable gap theorem. We obtain it 
as a consequence of a lower bound for the height of Zariski dense points lying on a curve due 
to F. Amoroso and S. David ADOOj. as explained in detail in Section [5] This result allows to 
decompose the given polynomial / G Q[x, y] into short pieces; the factors of / are then computed 
as the common factors of this low degree pieces. This strategy works for all factors except the 
trivial x and y and the cyclotomic ones, that is, factors which are a product of binomials (including 
monomials) whose coefficients are roots of the unity. As in the univariate and linear bivariate cases, 
these factors have to be handled separately, see Section 

Since our algorithm operates by reducing to the cases of dense bivariate and sparse univariate 
polynomials, our concern is only to prove that this reduction can be done in polynomial time in 
the sparse encoding. We have not attempted to compute the exponent in the complexity estimate, 
which in principle can be quite big. It is certainly possible to improve it in view of practical 
implementation: in Subsection 13 . 41 we present one idea in this direction, which consists on adapting 
the decomposition of / to the size of the candidate factor. 

As a consequence of the algorithm, we derive that the number of irreducible factors of degree < d 
of / € Q[x,y] counted with multiplicities (different from the trivial factors x or y) is bounded 
by (d ■ ^(Z)) ' 1 ' ■ This is not trivial, as the degree of / can be exponential in £(/) , but in fact 
much better can be said: 

Proposition 2. Let f £ Z[xi, . . . , x n ] and consider the factorization 

f = q-l[p ep 

p 

where q is a cyclotomic polynomial, p € Q[xi, . . . , x n ] runs over all non- cyclotomic irreducible 
factors of f , and e p is the corresponding multiplicity, then 

]Te p < 5 6 • n 3 ■ log 1 1/| | ! • log 3 (8ndeg(/)). 
p 

In particular the total number of non-cyclotomic irreducible factors of any degree of / is poly- 
nomially bounded in terms of the sparse length of / . This fairly unexpected property general- 
izes |Dob79l Thm 2] and is a further consequence of the connection with Diophantine Geometry 
via the theory of heights: the Amoroso-David lower bound together with the theorem of succes- 
sive algebraic minima of S.-W. Zhang Zha95 imply a lower bound for the Mahler measure of a 
non-cyclotomic polynomial, and from this the statement follows easily. 

Moreover, a positive answer to Lehmer's problem would imply in the univariate case, see Subsection 
II. 21 for details, the stronger estimate 

X^c-logll/Hx. 
p 

for some absolute constant c > . This is even more surprising, since it depends on the coefficients 
of / but not on its degree. It would be interesting to determine if it is possible to obtain such a 
bound without assuming Lehmer's conjecture. 
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This should be compared with another result of H.J. Lenstra Jr.: the total number of irreducible 
factors of degree < d of f E Q[x] counted with multiplicities (different from x) is bounded by 

c • t 2 ■ 2 d ■ d ■ \og(2dt) 

where t is the number of non zero terms of / |Len99al Thm 1]. This bound is exponential, but 
independent of the degree and coefficients of / . Based on these two results, it seems natural to 
consider the following generalization of Descartes' rule of signs: is the number of all irreducible 
(and non-cyclotomic maybe?) factors different from x of a i-nomial in Q[x] uniformly bounded 
by some function B(t) depending only on t, and maybe even by t°^' ? 

Trying to get further, one might ask if it is possible to compute in sparse polynomial time the 
absolute factorization of a sparse polynomial, that is the irreducible factors over Q . For the 
univariate case the answer is clearly "no" : a univariate polynomial splits completely as a product 
of linear factors, and this cannot be done in sparse polynomial time. For the bivariate case, it can 
be shown that the computation of binomial factors is equivalent to the factorization of a univariate 
polynomial, so that binomials factors over Q cannot be computed either. 

Here, we show that except for these, we can compute all other irreducible factors over Q of low 
degree, in sparse polynomial time. To give sense to such a statement, we have to specify the 
way algebraic coefficients are handled: a number field K is described by an irreducible monic 
polynomial g — X^oft/" 2 ' 7 ^l z ] sucn that K — Q($) f° r one °f its roots, and this g is given 
in dense representation by the list of all coefficients gj in some specified order, including the zero 
ones. Each irreducible factor p in the output of the algorithm is encoded by giving a number 
field K such that p £ K[x,y] and by the dense list of its coefficients, each coefficient b E K 
being represented by its vector of rational components b := (bo, • ■ ■ , 65-1) with respect to the basis 
(Q J )a<j<8-i ■ 

Theorem 3. There is a deterministic algorithm that, given f £ Q[x,y] and d>l, computes all 
irreducible factors of f in Q[x, y] of degree < d , together with their multiplicities, except for the 
binomial ones, in {d ■ £(f))°^ bit operations. 

This algorithm follows from another suitable gap theorem that we obtain as a consequence of a 
further result of Amoroso and David, a quantitative version of the Bomogolov problem over the 
torus |ADf)3| . Furthermore, we deduce from their result an estimate for the number of non-binomial 
factors of a given / G Q[xi, . . . , x n ] (Proposition 11.4(1 . 

Several interesting questions arose during our work. The most obvious is the extension of these 
algorithms to multivariate polynomials; this seems quite feasible as the necessary lower bounds 
for the height of points in a hypersurface already appeared in the literature [ADOfll IAD03I IPonOll 
IPon()5b| . 

An interesting open problem is the following: the restriction to computing bounded degree factors 
keeps their length under control, giving the possibility of computing them in sparse polynomial 
time. But, what if we look for factors with a fixed number of monomials, can we still find all of 
them in sparse polynomial time? For instance, can we compute all trinomial factors 

p = a lX ai + a 2 x a2 + a 3 x a3 G Q[x] 

of a given / e Q[x] in polynomial time? 

The outline of the paper is as follows. In Section ^ we explain the basics of the height theory for 
points, polynomials and curves, and we prove the upper bounds for the number of factors of a 
sparse polynomial. In Section|3we obtain the gap theorems, as a consequence of the lower bounds 
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for the height of points on curves. In Section[3]we present the algorithms for rational and absolute 
factorization and estimate their theoretical complexity. 
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Throughout this paper Q denotes the field of rational numbers, K a number field, L a finite 
extension of K , Q an algebraic closure of Q and the subset of Q of all roots of the unity. 
We denote by A™ the affine space of n dimensions over Q . For a polynomial p e Q[x\, . . . , x n ] 
we denote by Z(p) C A" the affine hypersurface defined by p . A curve or a variety is assumed to 
be equidimensional; by irreducibility of a variety we understand its geometric irreducibility, that 
is with respect to Q . 

For every rational prime p we denote by \ ■ \ p the p-adic absolute value over Q such that 
\p\p = P^ 1 ■ We also denote the ordinary absolute value over Q by | • |oo or simply by | • | . 
These form a complete set of independent absolute values over Q : we identify the set Mq of 
these absolute values with the set {oo,p; p prime} . More generally, we write Mk for the set of 
absolute values over K extending the absolute values in Mq , and we note by M|? the subset of 
Archimedean absolute values of Mk ■ 

For vq £ Mk we denote by Q„ the completion of Q with respect to the absolute value vq . In 
case v o — oo we have Qoo = R, while in case vq = p is a prime, Q p is the p-adic field. There 
exists a unique extension of vq to an absolute value over the algebraic closure Q„ . For v £ Mk 
we also denote by K v the completion of K with respect to v . If v extends an absolute value 
vq e Mq , then K v is a finite extension of Q„ . We denote a v : K <— > Q v a (not necessarily 
unique) embedding corresponding to v , that is such that \a\ v = \a v (a)\ Vo for every a G K . 

1.1. Height of points and polynomials. In this subsection we introduce the basic definitions 
and properties of the height of points and polynomials that we will use in the sequel. We refer for 
instance to HS00 for a complete treatment. 

The (logarithmic) height h{£) of an algebraic number £ g Q can be defined in terms of its primitive 
integer minimal polynomial 



We have h(£) > , and h(£) = if and only if either £ = or £ S Goo , the subset of Q of all 
roots of 1 (Kronecker's theorem). Besides, for a rational £ = m/n e Q x in reduced expression, we 



Note 



1. Heights 



p 6 (x)=c- Yl (x-a(0) 6Z[i] 



where a runs over all Q-embeddings of K := Q(£) in Q, by the formula 



(1) 
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easily check that = logmax{|m|, n} . Alternatively, the height can be defined via the Mahler 
measure of the minimal polynomial as 

mfo) := / log b e (e 2 "")| du = [K : Q] ■ 
Jo 

this identity is a consequence of Jensen's formula. 

More generally, the height of a point £:= (£1, ...,£„)£ A™ is defined via the Weil formula 

^ ):= L^©1 51 ^ :Ov]lGgmax{l,|£i|„,...,|£ n |„} 
for any number field if containing the coordinates £j . For n = 1 this gives 

and it can be shown that this coincides with the previous definition. With this expression we 
readily verify that for £, t] £ Q we have that h(£ ■ r)) < h(£) + h(t]) and 

- |n| /i(0 for neZ; 

in particular = h(£) and • £) = for any root of unity w € Goo ■ We will be mostly 

interested on points in the plane £ = (£1,62) € A 2 , in that case the formula reduces to 

m = jK^O\ E [^:Q,]logmax{l,|aU,|6U}- 

Now we introduce a few notions for the height of a polynomial that will prove useful in the sequel. 
We will restrict to bivariate polynomials, although it is clear that all this extends to the multivariate 
case. 

For a polynomial / = J2l=i a i x ai y fii € K[x, y] , its absolute value with respect to v € Mk is 

:= max{|oi|„, . . . , \a t \ v }. 

The height of / is then defined as 

M/):= nroi E :Q.]io g (i/u), 

which is invariant by scalar multiplication because of the product formula 

[K v ■ Q«] log(|a|„) = 0, Vae^. 

Therefore is the Weil height of the projective point (ai : • • • : a t ) . This is independent of 

the chosen field K as long as it contains all of the ai 's. 

For a bivariate polynomial with complex coefficients / e C[x, y] we consider the Mahler measure 

m(f) := / [\og\f(e 2 « m ,e 2 ™)\dudv, 
Jo Jo 

and for a polynomial / e K[x,y] with algebraic coefficients we define its (global) Mahler measure 
by the adelic formula 

m Q (/) := Jk^o] E : <w ™m/)) + E : «W l0 § I/I- • 
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We also consider the height associated to the i 1 -norm: 

hi(f) := ( £ [K v : Q v ] tog|K(/)||i + £ [K„ 

For a primitive f € Z[x, y] , these notions give 

= log |/| = logmaxjKI, . . . , \at\}, = log||/||i = Iog(|oi| + • • • + |a t |), m^/) = m(/). 

All these are invariant by scalar multiplication. In general for any / G Q[x, y] write / = c • / for 
some ceQ x and / 6 Z[x, y] the primitive polynomial with integer coefficients associated to / , 
then h(f) = log |/| , hi(f) = log H/Hi and mtff) = m(f) . 

We will use the following comparison between the heights of a given / G K[x,y] , which can be 
directly proven from the definitions: 

(2) h(f),m^(f) < h^f) < h(f) +\og(t). 

1.2. Height of and on plane curves. A plane curve C C A 2 can have some isolated points of 
small height. For instance the line 

Z(x + y - 1) C A 2 

has the points (1,0), (0, 1), ((1 ± %/3)/2, (1 =F \/3)/2) all of whose coordinates are roots of 1 and 
so their height is 0. D. Zagier [Zag93| showed that the height of any other point £ G Z(x + y — 1) 
is bounded from below by a positive constant 

MO > hfa) = o.i9ii 

where £o denotes the largest real root of the polynomial x e —x i — l. Somehow the fact that a curve 
has some torsion points on it does not reflect its general behavior. A more interesting parameter 
is the height of a Zariski dense set of points. This is measured by the essential minimum, which 
for a plane curve C C A 2 is defined as 

M «»(C) : = inf { v > : {£ G C : h(£) < r)} is an infinite set }. 
For instance, thanks to Zagier's result, 

Li css (Z{x + y -1)) > 0.1911. 
This is a particular case of the Bogomolov problem over the torus proved by Zhang Zha95 which 
asserts that for a subvariety of T n := (Q X ) n , the vanishing of the essential minimum is equivalent 
to being torsion. This result, and others we are going to use, are stated for the torus, but T" is 
naturally embedded as an open subset of A" , and since these results depend on Zariski dense sets, 
they can all be translated to A™ . 

For an irreducible plane curve C C A 2 , being torsion is equivalent to say that there exist a, f3 > 
not both zero, and u> G Goo U {0} such that 

either C = Z(x a - toy 13 ) or C = Z(x a y p - to). 

The irreducible curve C is (we should rather say "corresponds to") a translate of a subgroup 
whenever there exists £ G Q such that 

either C = Z{x a - £y ) or C = Z{x a y f3 - £). 

By definition, a general affine plane curve is torsion (resp. translate of a subgroup) if and only if 
all its irreducible components are so. The statement of the Bogomolov problem (now a theorem) 
is that pL ess (C) = if and only if C is torsion. In other words, if C is not of this form, there 
exists a positive constant c(C) > such that 

h{£) > c(C) for all but a finite number of £ G C. 



Q„] log |/|, 
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There is an extension of the notion of Weil height of points to higher-dimensional varieties. This 
notion was first introduced by P. Philippon Phi91J; for an irreducible hypersurface V C A™ 
defined by a polynomial p <E K[x\,... ,x n ] , it coincides with the global Mahler measure of p 
|l)P99llPonf)T1 : 

(3) h(y)=mtfp). 

The distribution of the height of algebraic points in a curve is in close connection with the 
height of the curve itself. The relation is given by the theorem of algebraic successive minima 
of Zhang |Zha95l Thm 5.2 and Lem. 6.5(3)]: 

» CSS (C) < < 2 M CSS (C). 

Actually, Zhang's result is more precise (all successive minima appear, not only the first one which 
is the essential minimum) and more general, as it works for varieties of any dimension and for any 
"reasonable" height function. 

The stated version is sufficient for our application; for a more elementary proof we refer to |I)P99 
§ 6] . It is an open problem to determine if this estimate is optimal for the case of plane curves or 
more generally for hypersurfaces ( it has been shown to be optimal if we allow varieties of higher 
codimension PS04J Thm 5.1]). Thanks to this result, the Bomogolov problem for plane curves can 
be rephrased as h{C) = if and only if C is torsion. Under this form, the conjecture was already 
proven by W. Lawton in 1977 |Law77j . 

For £ e Q X we have that h{£) — if and only if £ S G M ; this is the O-dimensional (easy) case 
of the Bogomolov problem. Lehmer's conjecture gives a lower bound for the height of non-torsion 
points, its statement being that there exists a positive constant c > such that 

HO > ^ ^ for £ i Goo. 



This conjecture has been widely generalized. Here we are only interested in the case of curves: 
Conjecture 1.1. 

(i) Lehmer problem for plane curves: let C C A 2 be an irreducible curve defined over a 
number field K which is not torsion, then there exists a universal c > such that 



H ess (C) > 



c 



[K : Q] deg(C) ' 

(ii) Effective Bogomolov problem for plane curves: let C C A 2 be an irreducible curve which 
is not a translate of a subgroup, then there exists a universal c > such that 

c 



// SS (C) > 



deg(C) ■ 



These two conjecture look similar but they are not. The generalization of Lehmer problem is 
of arithmetic nature since the degree of the number field plays a role, while the quantitative 
Bogomolov problem is of geometric nature since it makes no reference to the field of definition. 
It has been shown that conjecture ll.llT ) is implied by the classical Lehmer problem |Law77| . 
Conjecture On) is |I)P99I Conj. 1.1]. 

Because of the theorem of successive minima, it is equivalent to have lower bounds for the essential 
minimum or for the height, that is the (global) Mahler measure of the defining polynomial of C . 

Nowadays all these results are proved "up to an e " : for the Lehmer problem we will be mainly 
applying the following lower bound due to Amoroso and David |AD00j . in the version of C. Pon- 
treau |Pon05al Prop. IV. 1] who simplified the proof and made all constants explicit: let C C A 2 be 
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a non-torsion curve denned by an irreducible polynomial p G Z[x, y] and set d := deg(C) = deg(p) , 
then 

J_ / loglog(16d) \ 3 



(4) ™^"V log(16.) J 

In the reference this result is stated in terms of h(C) ; you have to look into the proof for the 
version up here. In fact we will be using the version over a number field: 

Corollary 1.2. Let C G A 2 be a curve defined by an irreducible polynomial p G K[x,y] which is 
not of the form p = Y[i( xa — ^iy^ 3 ) n or p — Yiii^V 13 ~~ w i) f or some a, (3 > not both zero and 
u>i G Goo U {0} and set d := deg(C) = deg(p) , then 

M e SS((7)> 1 . floglog(16[K:Q}dy ; 



5 6 [K : Q]d V log(16[if 
This follows immediately from J3J by considering the norm N(p) := J^J a(p) G Q[x, y] . 

For the effective Bogomolov problem we use another result of Amoroso and David: let C C A 2 be 
a curve which is not a translate of a subgroup and d := deg(C) = deg(p) , then [AD03I Thm 1.5]: 

(loglog(d + 2)) 4 



(5) M ess (G) > 



2 70 d (log(d + 2)) 5 



1.3. On the number of factors of a sparse polynomial. General lower bounds for the Mahler 
measure yield immediately upper bounds for the number of factors of a given polynomial. To our 
knowledge, this observation appears for the first time in the work of E. Dobrowolski Dob79 . Here 
we treat the general n -dimensional case. The notions and results of the previous subsection extend 
to hypersurfaces. We will state them but instead refer the interested reader to the literature for 
n> 3. 

We recall that a polynomial is cyclotomic if it is a product of binomials (including monomials) 
whose coefficients are roots of the unity. 

Proposition 1.3. Let f G K[x\, . . . ,x n ] and consider the factorization 

f = q-]Jp ep 

v 

where q is cyclotomic, p G K\x\, . . . ,x n ] runs over all non- cyclotomic irreducible factors of f , 
and e p is the corresponding multiplicity, then 

e P < 5 6 • n 3 ■ [K : Q] • h,(f) ■ log 3 (8n[X : Q] deg(/)). 

p 

Proof. We have that m^q) = as q is cyclotomic an so 

p 

For each non-cyclotomic factor p£ Q[si, ■ ■ ■ , x n ] we minorate the Mahler measure by the Amoroso- 
David's lower bound in the version of Pontreau |Pon01l Thm 1.6] (see the estimate Q above for 
the case n = 2 ), from which we derive that if V C A" is an hypersurface defined by an irreducible 
polynomial over K , then 

[K : Q] ■ h(V) > 1 flog("log(8n[tf:Q]deg00r " 



5 6 -n 3 V log(8n[if : Q] deg(V)) 
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Therefore, by Identity J3J), we have 

\K : Ql • m„(p) > 5 — > 5 — - , 

" 5 6 • n 3 • log 3 (8n[if : Q] deg(p)) ~ 5 6 • n 3 • log 3 (8n[if : Q] deg(/)) 

which implies 

[if : Q] • > 5 — Ve p 

" 56 • n 3 • log 3 (8n[X : Q] deg(/)) ^ P 

from where we deduce our result. □ 

This is a generalization to n > 2 of Dob79 Thm 2]. As said, a positive answer to the classical 
Lehmer problem would imply a positive lower bound for the Mahler measure of an arbitrary non- 
cyclotomic polynomial p G K [x±, . . . , x n ] , of the form 

for some universal constant c > , namely Coniecture lLHT l. Applying this to the argument above, 
the previous proposition would improve to 

(6) ^ e p < c" 1 • [if :Q] •/!!(/). 

p 

In a similar way, we can produce an upper bound for the number of non-binomial irreducible factors 
over Q : 

Proposition 1.4. Let f £ Q[a;i, ■ ■ ■ , x n ] and consider the factorization 

p 

were q is a product of binomials, p € . . . ,x n ] runs over all non-binomial irreducible factors 

of f , and e p is the corresponding multiplicity, then 

5> p < 10 14 • n 8 ■ h x {f) ■ log 5 (max{16,7ideg(/)}). 
p 

Proof. We have that 

e P m ®(p) = m q(f) < h i(f)> 

p 

apply the Amoroso-David quantitative Bogomolov problem in the version of Pontreau |Pon05bl 
Thm 1.5] (or (JSJ above for the case n = 2). □ 

Similarly, a positive answer to the effective Bogomolov problem fConiecturc ll.lf ii)) would imply 
that 

/J e P < c~ x ■ hi(f) for a universal constant c > 0. 

p 

2. Gap theorems 

By a gap theorem, following |CKS991 ILen99bl IKK05| , we understand a statement asserting that 
for a polynomial / decomposed as 

f = r + s 

for non-zero polynomials r and s , then / has a given property if and only r and s have it, 
provided that r and s are sufficiently separated. We introduce some notation: 

Definition 2.1. For p E Q[x,y] such that deg y (p) > 1 we set 

X(p) := inf {ij > : {(ui, v) 6 x Q : p{lo, v) — 0, h(v) < rj} is an infinite set }. 
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Since deg y (p) > 1 , for all but a finite number of u e there exists some v e Q such that 
p(uj, v) — and so X(p) is well-defined and non-negative. 

In what follows we deal with irreducible polynomials, that are defined up to a scalar factor. For 
simplicity we always refer to one (obvious) representant in each class of associate irreducible poly- 
nomials. 

The following is the main result of this section: 

Theorem 2.2. Let f,r,q£ Q[x,y] be such that f = r + y u ■ q . Let also be given an irreducible 
polynomial p € Q[x,y] , p ^ y , such that deg y (p) > 1 , and suppose that 

(u-deg y (r))-X(p)>h 1 (f), 

then p divides f if and only if it divides r and q . 

For its proof we need the following lemma: 

Lemma 2.3. Let f,r,q € Q[£,y] be such that f = r + y u ■ q . Let also be given u) G G^ and 

v G Q X be such that f(ui,v) = but g(o>,^) ^ 0, then there exists a constant 5(f) > not 
depending on (ui, v) such that 

( W -deg>))-/^)<M/)-<K/)- 

Proof. Let K he & number field containing the coefficients of / , ui and v , and set k := deg y (r) . 
For each absolute value v e Mk we have two cases: 

• \v\ v < 1 : since \u\ v = 1 we have that 

' IK(?)lli fovveM™, 
\q\v for v (£ M%>. 

\v\ v > 1 : using that f(uj, v) = r(oj, v) + v u q(u 1 v) = wc infer that 

' MMKMIIi fovveMg, 

W\v ■ \r\ v for v M£?. 

As both r and q are non-zero, ||fr„(g)||i, ||cr„(r)||i < ||c„(/)||i and so 

log \\a v (q)\\ u log||a„(r)||i <log||a„(/)||i-J(/) 
for some 6(f) > depending only on / . The previous inequalities imply that 

' log | |*„(/)||i -<*(/) for.eM^ 



\q(w,v)\ v < 



\v\% ■ \q(u,v)\ v = \r{w,v)\ v < 



(u - k) logmax{l, \v\ v ] + log \q(oj, v)\ v < 



log fovv^Mg. 



By summing up over all absolute values, using the product formula and the definition of the height, 
one obtains that 

(u - k) ■ h(v) = 1 E l K » : Q"] (( u ~ k ) logmax{l, W\v) + log \q(u, v)\ v ) 
t ' ^ veM K 



< 



t^t^t E [K v :Q v }(\og\\a v (f)\\i-Kf)) + E = Qv] log 



= h 1 (f)-S(f). 

□ 
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Proof of Theorem \2.S\ The " -<= " is trivial, so we show the other implication. 
Suppose that p \ f but p \ q . From the fact that p is irreducible we have that the set of common 
roots of p and q is finite. Also, since deg y (p) > 1 and p ^ y, the set {(u>,v) G Goo x Q x : 
p(ui, v) — 0} is infinite. Given e > , it follows from the definition of X(p) that the set {(uj, v) G 
Goo x Q : p(w, v) — 0, h(v) < \(p) — e} is finite. Therefore there exist an infinite number of 
(uj, v) G Goo x Q X such that p(uj, v) — and h(v) > X(p) — e , and there still exist some uj G Goo 
and i/£Q X such that 

p(ui, v) — 0, q(u, v) ^ and h{v) > X(p) — e. 

Applying Lemma \l. \\\ 

(u - k) (A(p) - e) < (u - A) h{v) < hi{f) - 6(f). 

Since this holds for all e > , we infer 

(u-k)X(p)<hx(f)-8(f)<hi(f) 

because 6(f) does not depend on (lo, v) and so does not depend on e either. This contradicts 
the hypothesis: (u — k) X(p) > hi(f) . Therefore p \ q and p\ — y u ■ q — r as wanted. □ 

Of course this result is only useful whenever X(p) > . What happens is that this parameter is 
bounded from below by the essential minimum, and so all existing estimations for the essential 
minimum will give us a corresponding gap theorem. 

Lemma 2.4. Let p be an irreducible polynomial in K[x,y] such that deg y (p) > 1 . Then 

X(p) > v css (z(p)). 

Proof. Observe that h(v) — h(u>, v) ; we can then rephrase the definition of X(p) as 

X(p) = inf {t) > : {£, e Z(p) n (Goo x Q) : h(£) < T)} is an infinite set }. 

Compare with the definition of the essential minimum: 

H css (Z(p)) = inf {r) > : G Z(p) : h(£) < r,} is an infinite set }, 

so that X(p) is the infimum over a subset of the set used to define /i css (Z(p)) and the inequality 
is clear. □ 

Equality in Lemma 12.41 above does not necessarily hold: consider p :— x a — ^y^ , then for any 
(uj, v) £ Goo x Q we have that p(ui, v) = = and so 

_ h(yP) _ h(io a /0 _ h(£) 

W ~ P ~ P ~ P " 

Hence 

X(p(x, y)) = h(0/P while X(p(y, x)) = h(0/a, 
in particular A depends on the order of the variables, while of course the essential minimum does 
not, so there cannot coincide in general. One can prove that n css (p) = h(£)/ max{a, /?} [pS04 
Prop. 5.4]. 

From Corollary If .21 we deduce: 

Corollary 2.5. Let f,r,q G K[x,y] be such that f = r + y u ■ q . Let also be given an irreducible 
p G K[x, y] that is not of the form p = Yii( xa ~~ ^iU^) nor P — Yiii^V 13 ~ w ») f or some a : P ^ 
not both zero and G Goo U {0} , and set d :— deg(p) . Suppose that 

u - deg y (r) >5 6 -[K:Q]-d- log 3 (f6[if : Q]d) ■ ht(f), 

then p divides f if and only if it divides r and q . 



FACTORING BIVARIATE SPARSE (LACUNARY) POLYNOMIALS 



13 



Similarly we obtain the following gap theorem from the lower bound J5|: 

Corollary 2.6. Let f,r,q£ Q[x, y] be such that f = r + y u ■ q . Let also be given an irreducible 
p G Q[x, y] which is not a binomial, and set d := deg(p) . Suppose that 

u - deg y (r) >2 7a -d- \og 5 (d + 2) • h t (f), 

then p divides f if and only if it divides r and q . 

3. Computing the low degree factors of sparse polynomials 

The goal of this section is to present the rational and absolute factorization algorithms for sparse 
bivariate polynomials. Our conventions about encoding are the usual ones, the same as in for 
instance |Len99bj . The number of bits needed to write down an integer a 6 Z is [log 2 (°)J + 1 f° r 
the digits and 1 more for the sign. For a rational a = m/n € Q in reduced expression, we define 
its bit length as 

1(a) = t(m) + i(n) - 2 = [\og 2 \m\\ + [\og 2 (n)\ + 2; 
the somewhat artificial "—2" is there just to make this coincide with the previous notation for an 
integer a. The sparse encoding of / = J2l=i a iX ai y l3i S Q[x,y] is the list (a,, Oi,/Jj)i<t<t of its 
(non-zero) coefficients and corresponding exponents, and so its bit length is 

t 

(7) *(/) := £ (t(oi) + Llog 2 (a,)J + Uog 2 (A)J + 2) ; 

i=l 

observe that £(f) is an upper bound for t, log 2 (deg/) and h(f) , and in fact is polynomially 
equivalent to these quantities: 1(f) = (t ■ log 2 (deg/) • h(f))°^ . 

For encoding polynomials over number fields we have to say how number fields and algebraic 
numbers are handled: a number field K of degree S = [K : Q] is described by an irreducible 
monic polynomial g = Y2j=o 9j 2 ^ *= ^\ z \ such that K — Q(9) for one of its roots, and this g is 
given in dense representation by the (ordered) list of all its coefficients gj including the zero ones. 
The length of this description is 

5-1 

t{K) :=J2 £ (9j); 
j=o 

in particular t(K) > [K : Q],h(g). An element b E K is represented by its vector of rational 
components (bo, . . . , b$^\) with respect to the basis )o<j<<5— 1 • It can be shown by (you need 
some estimate between the height of an algebraic integer and that of its minimal polynomial) that 

Kb) < t K (b) + [K : Q](h(g) + [K:Q] log(2)) = (£(K) + £ K (b))° w . 

A sparsely given polynomial / = Y^i=i o,iX oli y^ i € K[x,y\ is then encoded by the list of its 
(non-zero) coefficients and corresponding exponents, and its length relative to K is 

t 

M/) := Y^Vxioi) + Ufa) + m))- 

i=l 

Note that the input data is specified by / and K , and so the input length is l(K) + in(f) ■ We 
have that 

t, log 2 (deg /) < £(f) and h(f) < £ K {f) + [K : ®}(h(g) + [K:Q] log(2)) = (l(K) + £ K (f))° {1) ■ 

When the input of our algorithms comprises an inclusion K L of number fields, L is described 
as an extension of K by a monic irreducible polynomial k(z) € Ok[z] such that L = K(d) for 
a root 1? of k; this polynomial is represented in a dense way. A polynomial p € L[x,y] in the 
output is then encoded by the (dense) list of its coefficients with respect to the product basis 
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(# 5, # fe )o< i 7<5-i,o<fc<7-i °f L over Q; here we set 7 := [L : K] . Note that for an element b G K 
in the base field encoded as b = b + ■ ■ ■ + bs-ix 5 ^ 1 with respect to the given basis of K over Q , 
its encoding with respect to the product base will be the same and so 

l L (b)<[L:K]l K (b) 

since we have to count the zero coefficients corresponding to the monomials 9^d k with k > 1 . In 
particular £ L (f) < [L : K] £ K (f) for / G K[x, y] . 

For the absolute factorization algorithm for / G K [x,y] , the output irreducible polynomials pi G 
Q[x,y] are encoded by (L i} pi) , where Li consists in the minimal extension of K such that 
Pi G Li[x,y] (we observe that this encodes a full set {o'(pi)) (T .K<-*Q of [Li : K] conjugate factors 
of /). The couple (Li,pi) is encoded by a monic irreducible polynomial h(z) G Ok[z\ such that 
Li = K[z]/{ki{z)) , and pi is given by its coefficients. 

3.1. Binomial factors. The computation of the irreducible factors of a bivariate polynomial that 
are binomials of more generally products of binomials can be reduced to the univariate case as we 
show in this section. We first observe that an irreducible polynomial p G K[x,y] is a product of 
binomials if it has one of the following forms: 

(8) p(x,y) = H(x a -*(Oy P ) or p(x, y ) = H(x a j/ -a(0), 

cr (7 

where a, [3 > are not simultaneously £ G Q and where a : K(£) ^> Q runs over all 
if -embeddings of K(£) in Q. 
We have the following results: 

Lemma 3.1. Let a, (3,n € N , £ G Q* and f G be given. Set z for a new variable and 

denote by g G Q[x,y, z] the remainder of the division with respect to the variable x of f(x,y) by 
the monic polynomial x a — zy 13 . Then 

(x a -^y f3 ) n \f(x,y) (z-O n \g(x,y,z). 

Proof. Consider the ring 

A:=Q[x,y ±1 ,z]/(x a -zy' 3 ). 

We have that x a — ^y 13 — (z — £,)y^ m A , and, since y is invertible, we have the following equality 
of ideals 

= ((*-O n )inA 

We call this ideal / . By definition / = g in A and so f E I if and only if g E I , that is 

(x a ~^y f3 r\f(x,y) in A ^ (z - $ n \ g(x,y, z) in A. 

We have to show that we can take out the words "in A" from the above statement. 
We observe that there is a natural identification A = Q[x, y ±1 ] . Therefore, 

(x a -Cy P ) n \f in A (x a -^y^ n \fmQ[x,y ±1 ] ^ (x Q - / in y] 

since y is prime to x a — £ y@ . 
We have a second identification 

a-l 

A = @®[y ± \z}-x3, 

and therefore 

(z-£) n \g in A (z-£) n \g in Qfay+^z] (z-£) n \g in Q[x,y,z] 

since y is prime to z — £ . □ 
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Corollary 3.2. With the same notations than in the previous lemma, let K be a number field 
and suppose that f £ K[x,y] . Set 

p(x,y) :=H(x a - a^y?) e K[x,y] and q(z) :=*[[{z -*({)) e K[z] 

where a runs over all K -embeddings of K{^) in Q, then 

P(x,y) n \f(x,y) q(z) n \g(x,y,z). 

Proof. The polynomials x a — o~(£)yP for different a 's are relatively prime, and the same is true 
for the polynomials z — er(£) . Hence p(x,y) n | f(x,y) if and only if (x a — d(£)y /3 ) n | f(x,y) for 
all a if and only if (z — er(£))™ | g(x, y, z) for all a if and only if q(z) n \ g(x, y,z) . □ 

The algorithm to compute the irreducible factors of / S K[x,y] , of degree bounded by d, that 
are product of binomials is now clear: 

We are looking for factors p{x,y) £ K[x,y] of degree < d of one of the forms in (JSJ). The cases 
£ = 0, a = or /3 = reduce directly to the univariate case where we apply Lenstra's algorithm 
|Len99bl Thm] to the corresponding content of / . 

So we can restrict ourselves to the cases when £ £ Q* and a, (3 £ N . We consider first the factors 
of the first form in 

We fix 1 < a, [3 < d, and we set g :— g a jj £ K[x,y,z] for the remainder of dividing / (with 
respect to x ) by x a — zy^ ( g depends only on / and a, (3 ). It is easy to compute g by Euclidean 
division: 

t 

g(x,y,z)=J2 ^ m ° d a (z /) [Qi/qJ / l . 

i=l 

so that g is as sparse as / . We write 

g(x,y,z) = ^2g hj (z)x*y 3 

and observe that an irreducible factor q £ K[z] satisfies q n \g -4=4> q n \g%,j for all i,j , where 
there are at most t non-zero polynomials g^ j , and each of them is as sparse as / , with coefficients 
obtained as the sum of at most t coefficients of / . 

We compute all irreducible factors q £ K[z] of g of degree bounded by d/max{a,/3} and their 
corresponding multiplicities, by examining the common irreducible factors (and their multiplicities) 
of all the gij 's. This is done again applying Lenstra's univariate algorithm. 

Since the irreducible polynomial q is of the form q = Y[ a ( z — <7 (0) > the corresponding irreducible 
factor p of / is then derived as 

p(x,y) = (yP) dc ^q(x a y- P ), 
where deg(p) = max{a,/3} • deg(g) < d. 

Corollary 13.21 certifies that for given a, (3 , we obtain in this way all irreducible factors of / of 
degree < d of the first form in JHJ, as well as their multiplicities. 

For the factors in (JHJ of the second form, we proceed similarly, by considering the remainder 
g £ K[x, y , z] of dividing / (with respect to x ) by x a y /3 — z . We observe that the corresponding 
extensions of Lemma lXTI and Corollarv l3 . 21 hold . In this case, p is derived from the factor q £ K[z] 
of g as p(x,y) = q(x a y /3 ) . 

The algorithm described above yields the following result: 

Theorem 3.3. There is a deterministic algorithm that, given f £ K[x,y] and d>l, computes 
all irreducible factors of f in K[x, y] of degree < d which are products of binomials, together with 
their multiplicities, in (d ■ (£ (K) + iK(f)))°^ bit operations. 
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Proof. We have already established that the previous algorithm gives these factors and their mul- 
tiplicities. Its running time is estimated as follows: for each pair a, /3 , we are applying Lenstra's 
algorithm < t times to the polynomials g^j of sparse length ((gi.j) — 0(£(f)) , in order to com- 
pute their irreducible factors of degree < dj max{a,/3} and their multiplicities. This task is done 

in [d ■ (l(K) + £#■(/))) bit operations. Since there are at most d 2 pairs a,/3, the total bit 
cost of the algorithm remains of order (d ■ (£(K) +M/)))° (1) ■ □ 

3.2. Rational factorization. The search of all the low degree factors of a sparse / € K[x,y] is 
done by decomposing it as a sum of short pieces, as in the previous papers [CKS991 fLen99b K K05) . 
For given A x , A y > , these pieces have to be separated by a distance ("gap") of at least A x in 
the x -direction or A y in the y -direction. This is done here by decomposing / first with respect 
to the y -exponents, then with respect to the a; -exponents. 

Let / = J2t=i diX ai y^ i and suppose that the monomials are already ordered so that /3i < /?2 < 
• • • < (3t ■ Then we determine 

e ■= o < h < ■ ■ ■ < £ s < 4+1 = t 

subject to the conditions 

ft+i - ^ < A v for lj + 1 < i < 4+1,0 < j < s, and - p t] > A y for 1 < j < s, 

namely we split the y -exponents (3\ , . . . , (3t into subsets so that consecutive exponents in the same 
subset are at distance < A y and between different subsets there is a gap of length > A y . Set 

Tj := ^2 a.x^y^' 131 ^ 1 for < j < s so that / = y A °+i ro + y t3 ^+ 1 r 1 H h)/ ft ' +1 r s . 

i=i 3 +i 

Next we do the same procedure over each rj with respect to A x : first we reorder the monomials 
applying a permutation r so that 

tj+i 

r i = a T ^x a ^y 0r ^~^ +1 

and a T u. + i) < a T ( £ , +2 ) < •■■ < a T a- +1 ) ■ Then for each < j < s we sub-split this set of 
— £j exponents into subsets such that the consecutive x -exponents in the same subset are 
at distance < A x , and between different subsets there is a gap of length > A x . Using this, we 
decompose rj into pieces 

Tj = X^ OJ ro,j + • • ■ + X^i^Tt-j 

for some exponents {Ci,j ■ < j < s,0 < i < tj} C {a\, . . . , at} that we do not explicit to avoid 
useless proliferation of indexes. 

Each Ti_j is (up to a monomial) some part of rj , which in time is (up to a monomial) some part 
of / . We arrive in this way to a list of k < t non-zero polynomials fi,...,fk (after rewriting the 
r iy j 's into fi 's) such that 

(9) / = x" /1 y Sl f 1 +x' t2 y S2 f2 H h x^ y 5 * f k ; 

and by construction for 1 < i < k , 

£ K (fi) < l K (f), deg x (f t ) < (t - 1) A x , deg y (f t ) < (t - 1) A, 
and for i ^ j we have that 

either jj - 7* - deg^/j) > A x or 7l - 7j ■- degjfj) > A x 
or Sj - 5i - degy(fi) > A y or 5i — Sj — deg x (/j) > A y . 
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We have decomposed / in < t pieces of controlled degree and separated by a gap of length > A x 
in the x -direction or > A y in the y -direction. 

The computation of the irreducible factors of / of degree < d is then clear. First we compute a 
constant c such that hi(f) < c in time (£(K) + £ K (f))° {1) , as in |Len99bl Prop.3.6]. We set 

A x := A y := A = 5 6 • [K : Q] • d ■ log 3 (16[/iT : Q]d) ■ c. 

Applying Corollary EH we infer that for / = x^y Sl fx + x^ 2 y &2 f 2 + h x lk y 5k f k as in ©, then 

for p £ K[x,y] that is not a cyclotomic polynomial, we have 

P I / P I fi for all i. 

The procedure consists on computing first the cyclotomic factors together with their multiplicity, 
by using the algorithm in Subsection 13. II For the other factors, we compute them as the common 
factors of the fi 's, by using any polynomial-time algorithm for factoring dense bivariate poly- 
nomials over a number field, for instance |Len87l Thm 3.26]. Therefore we obtain the following 
result: 

Theorem 3.4. There is a deterministic algorithm that, given f £ K[x,y] and d>l, computes 
all irreducible factors of f in K[x,y] of degree < d, together with their multiplicities, in 

(d ■ (£(K) + M/)))° (1) bit operations. 

Proof. We have already established that the previous algorithm gives all these factors and their 
multiplicities. We estimate its running time. We show that the degree of fi for all i , 1 < i < k , 
in the decomposition is polynomial in the input size. This is a consequence of our estimate for 
the gap length: 

l(fi) < t(f) and deg^/O.deg^/O < (t-l)A = 0(t-([K : QH) 1+£ -c) = (d-(£(K)+£ K (f)))° {1) . 
Then we apply to each fi a polynomial-time algorithm for factoring dense bivariate polynomials 
over K , which would do the task in [d- (£{K) + £K{f)))°^ bit operations. Since the number of 
fi 's is at most t < £ , the total complexity remains of the same order. □ 

If for an input polynomial / G K [x, y] we are interested in its factors in an extension L , we can 
compute them by just including / into L[x,y] and then applying the above algorithm over L; 

its cost would be of (d ■ (£(K) + £ K (f) + £ K {L))° (1) bit operations. 

We note that here, for the factors which are products of binomials but not cyclotomic, we have 
the choice of computing them either by reduction to the univariate sparse case of Theorem 13. 31 or 
by reduction to the dense bivariate case. 

3.3. Absolute factorization. Given a polynomial / € K[x, y], we can apply Corollary 12.61 to 
extend the previous algorithm to the computation of all irreducible factors of / over Q , of degree 
bounded by d, except the binomial ones. We assume that the input / is encoded in K[x,y] and 
as before we compute a constant c such that hi(f) < c in time (£(K) + £K(f))°^ , then we set 

A,, := A y := A = 2 70 • d ■ log 5 (d + 2) • c. 

Corollarv l2 . 61 implies that for the associated decomposition / = x 7l y Sl fi+x l2 y S2 + - ■ ■ + x lk y Sk fk 
as in ©, any irreducible p £ Q[x,y] that is not of the form 

P(x, y) = x a - £,y or p(x,y) = x a y p - £, 

satisfies 

P I / P I fi for all i. 

Now we need to determine the common factors of the fi 's over Q[x, y] and their multiplicity. In 
order to do this, we first factor completely each of the fi over K[x,y] by applying any dense 
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polynomial-time bivariate factorization algorithm over K . An irreducible factor p € Q[x, y] of / 
will necessarily divide a common irreducible factor q € K[x,y] of all the fi 's. Thus it is enough 
to keep all common irreducible factors q £ K[x, y] of all the fi 's and their multiplicities, and then 
to factor them in y] by applying any polynomial-time algorithm for factoring dense bivariate 
polynomials over Q, for instance |Kal95l Theorem 11]. We only keep those factors in the output 
which are of degree < d and which are not binomials. We proceed in this way in order to avoid 
comparing irreducible factors in Q[x, y] of different fi 's, that can, although equal, be described 
in different field extensions. 

Theorem 3.5. There is a deterministic algorithm that, given f £ K[x,y] and d>l, computes 
all irreducible factors of f in Q[ar, y] of degree < d, together with their multiplicities, except for 

the binomial ones, in (d ■ (£{K) + M/)))° (1) bit operations. 

Proof. As with the previous one, the complexity of this algorithm is estimated in (d ■ {£{K) + 

(■K{f)])°^ bit operations, because we have to factor <t polynomials /, of degree polynomially 
bounded in the input length to find all possible q , which are of input length £k{q) — (d ■ (£(K) + 
^K(f)))° and at most the same quantity, and then to factor them mQ[x,y}. □ 

3.4. A practical improvement: adaptive gap methods. The practical efficiency of the pro- 
posed algorithms depends essentially on the length A defining the gap in / : the degree of the 
pieces fi depends on A , and if this degree is large, the dense factorization algorithm will be clearly 
slower. In other words, the smaller the gap length A is, the faster the algorithm works. Since the 
gap is proportional to the inverse of the essential minimum, the greatest the essential minimum, 
the faster the algorithm. 

There are some special situations where we can get better bounds, for instance for linear factors 
p(x, y) = ax + by + c with integer coefficients, as in KK05 . 

The Mahler measure of a polynomial is bounded from below by the Mahler measure of any of its 
facet polynomials. Hence for a,b,c £ Z relatively prime numbers such that a ■ b ■ c ^ , we have 
that 

m(ax + by + c) > max{m(aa; + by), m(by + c), m(ax + c)} = logmax{|a|, \b\, \c\} 

as it can be proved that the Mahler measure of a binomial coincides with its height. The theorem 
of successive minima then implies 

V em (Z(ax + by + c)) > ilogmax{|a|,|6|,|c|}= ~h(p). 

The only case for which this lower bound is meaningless is when a,b,c — 0, ±1 . (When a , b or c 
vanish, we reduce easily to the univariate case so we do not consider it here.) When a,b,c — ±1 , 
Zagier's theorem Zag93 , see also Subsection ll.2l shows that h(£) > 0.1911 . Hence 



H css {Z(ax + by + c)) > 



log(f ) = 0.1911 ifa,6,c = ±l 

h(p) > = 0.3465 otherwise. 



which improves the bound log(1.045) w 0.0440 proposed in KKQ^j. 

Note that in this case the gap size associated with p — ax + by + c gets smaller as the coefficients 
of p tend to infinity. Therefore, a good strategy to make the algorithm more efficient might be 
to exclude a finite number of candidates by testing them as factors of / (using a rough estimate 
for their gap length), and then use a much smaller gap length to find the rest of the factors by 
reduction to the dense case. 
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